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Resume. Le modele a blocs stochastiques (SBM) decrit les interactions entre les som- 
mets d’un graphe selon une approche probabiliste, basee sur des classes latentes. SBM fait 
I’liypothese implicite que le graphe est stationnaire. Par consequent, les interactions entre 
deux classes sont supposees avoir la meme intensite pendant toute la periode d’activite. 
Pour relaxer I’hypothese de stationnarite, nous proposons une partition de I’horizon tem- 
porel en sous intervalles disjoints, chacun de meme longueur. Ensuite, nous proposons 
une extension de SBM qui nous permet de classer en meme temps les sommets du graphe 
et les intervalles de temps on les interactions ont lieu. Le nombre de classes latentes (K 
pour les sommets, D pour les intervalles de temps) est enhn obtenu a travers la max¬ 
imisation de la vraisemblance integree des donnees completees (ICL exacte). Apres avoir 
teste le modele sur des donnees simulees, nous traitons un cas reel. Pendant une journee, 
les interactions parmi les participants de la conference HCM Hypertext (Turin, 29 Juin - 
ler Juillet 2009) ont ete traitees. Notre methodologie nous a permis d’obtenir une clas¬ 
sifications interessante des 24 heures: les moments de rencontre tels que les pauses cafe 
on buffets ont bien ete detectes. La complexite de I’algorithme de recherche, lineaire en 
fonction du nombre initial de clusters {K^ax et D^ax respectivement), nous oriente vers 
I’utilisation d’instruments avances de classihcation, pour reduire le nombre attendu de 
classes latentes et ainsi pouvoir utiliser le modele pour des reseaux de grand dimension. 

Mots-cles. Graphes aleatoires, classihcation temporelle, modeles a blocs stochas¬ 
tiques, vraisemblance classihante integree 


Abstract. In this paper, we focus on the stochastic block model (SBM), a probabilis¬ 
tic tool describing interactions between nodes of a network using latent clusters. The SBM 
assumes that the network has a stationary structure, in which connections of time varying 
intensity are not taken into account. In other words, interactions between two groups are 
forced to have the same features during the whole observation time. To overcome this 
limitation, we propose a partition of the whole time horizon, in which interactions are 
observed, and develop a non stationary extension of the SBM, allowing to simultaneously 
cluster the nodes in a network along with hxed time intervals in which the interactions 
take place. The number of clusters (K for nodes, D for time intervals) as well as the class 
memberships are hnally obtained through maximizing the complete-data integrated likeli¬ 
hood by means of a greedy search approach. After showing that the model works properly 
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with simulated data, we focus on a real data set. We thus consider the three days ACM 
Hypertext conference held in Turin, June 29th - July 1st 2009. Proximity interactions 
between attendees during the hrst day are modelled and an interesting clustering of the 
daily hours is hnally obtained, with times of social gathering (e.g. coffee breaks) recovered 
by the approach. Applications to large networks are limited due to the computational 
complexity of the greedy search which is dominated by the number Kmax and Dmax of 
clusters used in the initialization. Therefore, advanced clustering tools are considered to 
reduce the number of clusters expected in the data, making the greedy search applicable 
to large networks. 

Keywords. Random graphs, time event clustering, stochastic block models, inte¬ 
grated classihcation likelihood. 

1 Introduction 

Since the interactions between nodes of a network generally have a time varying intensity, 
the network has a non trivial time structure that we wish to infer. An example of this 
complexity can be observed in Figure Q. On the vertical axis the aggregated number of 
proximity face-to-face interactions (less than 1.5 meter) between attendees of the HCM 
Hypertext conference (Turin, June 29th - July 1st, 2009) is given. On the horizontal axis, 
a time line is reported, corresponding to the 24 hours of the hrst day of conference. The 
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Figure 1: Aggregated number of interactions between conference attendees per quarter-hour. 

day was partitioned in small time intervals of 20 seconds in the original data frame. We 
considered 15 minutes time aggregations leading to a partition of the day made of 96 
consecutive quarter-hours: each red point in the hgure corresponds to one of them. If we 
associate each attendee to a node and each interaction to an edge, the non stationarity 
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of such a graph is clear. In the following section, after describing the general stochastic 
block model, we illustrate the temporal evolution we propose. 

2 A non stationary stochastic block model 

We present here the SBM (Holland et ah [2j), using the same notations as in Wyse, 
Friel, Latouche (2014). The set of nodes A = {ai,...,aAr} is introduced. Undirected 
links between nodes i and j from A are counted by the observed variable Xij, being the 
component (i,j) of the N x N adjacency matrix X = Nodes in A are 

clustered in K disjointed subgroups respectively: 

A = Uk<KAk, n = 0, yi ^ g 

Nodes in the same cluster in A have linking attributes of the same nature. We introduce 
an hidden vector c = {ci,..., cn} labelling each node’s membership: 

Ci = k iff tti E Ak, Vfc < K. 

In order to introduce a temporal dimension, consider now a sequence of equally spaced, 
adjacent time steps {A^ := tu—tu-i}u<u over the interval [0, T] and a partition Ci,..., Cd 
of the same intervaQ We introduce furthermore a random vector y = }«<{/, such that 

Hu = d ii and only if 1^ '■=\tu-i)tf\ G Cd,'^d < D. We attach to y a multinomial 
distribution: 

p(y|/3.-D) = If/^T'. 

d<D 

where \Cd\ = : lu G Cd}- Now we dehne N-f as the number of observed connections 

between a* and aj, in the time interval and we make the following crucial assumption: 

p{N-f\ci = k,Cj = g,yu = d) follows a Poisson(A„Afcgrf), (1) 

hence the number of interactions is conditionally distributed like a Poisson random vari¬ 
able with parameter depending on k,g,d (A^ is constant). 

Notation: In the following, for seek of simplicity, we will note: 

n = n n n n = n 

k,g,d k<K g<K d<D ci i:ci=k 

and similarly for and Y\y^- 

and U are linked by the following relation: T = AuU. 
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The adjacency matrix, noted N^, has three dimensions (iV x N xU) and its observed 
likelihood can be computed explicitly: 


p{N^\A,c,y,K,D)=l[ 


/\^kgd 


k,g,d ^ 


Ho, n., n,. 


c,-^^kgdRkgd \^kgd 
^ ^kgd ’ 


( 2 ) 


The subscript u was 
or every u. 


where we noted Skgd ■= Ec, Ec, Ey„ and Rkgd :=\Ak\\Ag\\Cd 
removed from to emphasize that time steps are equally spacer' 

Since c and y are not known, a multinomial factorizing probability density p(c, y |<h, K, D), 
depending on a hyperparameter $, is introduced. The joint distribution of labels looks 
hnally as follows: 


pic,y\^,K,D)= n 


u 


l^fel 


Kk<K 


\d<D 


\Cd\ 

d 


(3) 


where <h = {u), /3}. 

2.1 Exact ICL for non stationary SBM 

The integrated classihcation criterion (ICL) was introduced as a model selection criterion 
in the context of Gaussian mixture models by Biernacky et al. [3]. Come and Latouche 
[B] proposed an exact version of the ICL based on a Bayesian approach for the stochastic 
block model and this is the approach we follow here. The quantity we focus on is the 
complete data log-likelihood, integrated with respect to the model parameters $ and 
A = {Xkgd}k<K,g<K,d<D- 

ICC = log (^jp{N^, c,y, A, D)dAd<!>^ . (4) 

Introducing a prior distribution i>{A,^\K, D) over the pair 4), A and thanks to ad hoc 
independence assumptions, the ICL can be rewritten as follows: 

ICC = log (j/(iV^|c, y, K, D)) + log (z/(c, y\K, D )). (5) 

The choice of prior distributions over the model parameters is crucial to have an explicit 
form of the ICL. 

^Self loops are considered here for seek of simplicity. The model can easily be extended to graphs with 
undirected links and/or no self loops. 
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2.2 A priori distributions 

We consider the conjugate prior distributions. Thus we impose a Gamma a priori over A: 

and a factorizing Dirichlet a priori distribution to <h: 

z/($|A', D) = Dir^(aj; a,...,a)x Dir£,(/3; 7 ,..., 7 ). 

It can be proven that the two terms in ([^, reduce to: 

/\Skgd 


'(Af^|c,y,ii',C) = n 


r(«) n.. Ho, n.., 


and: 


u{c,y\K,D) = 


k,g,d '' ^ i ICi 1 i-Cj 1 i-y-a ij 

^{Skgd + fl) 

[ARkgd + 6 ]^'=^'^+“ 


r{aK) Uk<K rd^fcl + «) r(7D) Ud<D r(|C'd + 7) 


r(a)^ T{N + aK) 


X 


r( 7 )^ T{U + -fD) 


( 6 ) 


(7) 


3 ICL maximization and experiments 

In order to maximize the integrated complete likelihood (ICL) in equation ([^ with respect 
to the four unknowns c,y, K,D, we rely on a greedy search over labels and the number 
of nodes and time clusters. This approach is described in Wyse, Trial and Latouche [1] 
for a stationary latent block model. 


4 Experiments 

4.1 Simulated data 

Some experiments on simulated data have initially been conducted. Based on the model 
described above, we simulated interactions between 50 nodes, clustered in three groups 
{K = 3). Interactions take place into 24 time intervals of unitary length (ideally one 
hour), clustered into three groups too {D = 3). Nodes and time intervals labels are 
sampled from multinomial distributions. With these settings, we consider 27 different 
Poisson parameters (As) generating connections between nodes. The generative model 
used to produce them is described by: 

Xkgi = Si[A;] + 52 ( 5 '] + Sa)/], k, g,l G {1, 2, 3} 
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where: 


Si = [0,2,4] S 2 = [0.5,1,1.5] S 3 = [0.5,1,1.5] 

ans Si[fc] denotes the k-th component of Si. Similarly for S 2 and S 3 . The greedy search 
algorithm we coded was able to exactly recover these initial settings, converging to the 
true ICL of —122410. Other experiments were run with different values inside vectors 
si, S2, S3. Not surprisingly the more nuanced differences between As are, the more difficult 
it is for the algorithm to converge to the true value of the IClj^ 

4.2 Real Data 

The dataset we used was collected during the ACM Hypertext conference held in Turin, 
June 29th - July 1st, 2009. Conference attendees volunteered to to wear radio badges that 
monitored their face-to-face proximity. The dataset represents the dynamical network of 
face-to-face proximity of 113 conference attendees over about 2.5 day^ Further details 
can be found in Isella, Stehle, Barrat, Cattuto, Pinton, Van den Broeck [7|. We focused 
on the hrst conference day, namely the twenty four hours going from Sam of June 29th 
to 7.59am of June 30th. The day was partitioned in small time intervals of 20 seconds 
in the original data frame. We considered 15 minutes time aggregations, thus leading 
to a partition of the day made of 96 consecutive quarter-hours {U = 96 with previous 
notation). A typical row of the adjacency matrix we analyzed, looks like: 


Person 1 

Person 2 

Time Interval (15m) 

Number of interactions 

52 

26 

5 

16 


It means that conference attendees 52 and 26, between 9am and 9.15am have spoken for 
16 X 20s ~ 5m30s. 

The greedy search algorithm converged to a final ICL of -53217.4, corresponding to 23 
clusters for nodes (people) and 3 time clusters. In Figure (2a) we show how daily quarter- 
hours are assigned to each cluster: the class Ci contains intervals marked by a weaker 
intensity of interactions (on average), whereas intervals inside C 3 are characterized by 
the highest intensity of interactions. This can either be verihed analytically by averaging 
estimated Poisson intensities for each one of the three clusters or graphically by looking 
at Figure (2b). In this Figure we computed the total number of interactions between 
conference attendees for each quarter-hour and it can clearly be seen how time intervals 
corresponding to the higher number of interactions have been placed in cluster C^, those 
corresponding to an intermediate interaction intensity, in C 2 and so on. It is interesting 


^Greedy search algorithms are path dependent and they could converge to local maxima. 
^More informations can be found at: 

http://www.sociopatterns.org/datasets/hypertext-2009-dynamic-contact-network/ 
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(a) Clustered time intervals. 


(b) Connections for every time interval. 


Figure 2: The aggregated connections for every time interval (2b) and time clusters found by 
our model (2a) are compared. 


to remark how the model can quite closely recover times of social gathering like the lunch 
break (13.00-15.00) or the “wine and cheese reception” (18.00-19.00). A complete program 
of the day can be found at: 

http://WWW.ht2009.org/program.php, 


5 Conclusions 

We proposed a non-st at ionary evolution of the latent block model (LBM) allowing us to 
simultaneously infer the time structure of a bipartite network and cluster the two node 
sets. The approach we chose consists in partitioning the entire time horizon in hxed- 
length time intervals to be clustered on the basis of the intensity of connections in each 
interval. We derived the complete ICL for such a model and maximized it numerically, 
by means of a greedy search, for two different networks: a simulated and a real one. The 
results of these two tests highlight the capacity of the model to capture non-stationary 
time structures. 
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